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Ranked set sampling is a sampling design which has a wide range of applications in industrial statistics, 
QQ and environmental and ecological studies, etc.. It is well known that ranked set samples provide more Fisher 

information than simple random samples of the same size about the unknown parameters of the underlying 
distribution in parametric inferences. In this paper, we consider the uncertainty and information content of 
ranked set samples in both perfect and imperfect ranking scenarios in terms of Shannon entropy, Renyi and 
Kullback-Leibler (KL) information measures. It is proved that under these information measures, ranked 
set sampling design performs better than its simple random sampling counterpart of the same size. The 
information content is also a monotone function of the set size in ranked set sampling. Moreover, the effect 
of ranking error on the information content of the data is investigated. 
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1. Introduction and Preliminaries 



^ During the past few years, ranked set sampling has emerged as a powerful tool in statistical inference, and 
it is now regarded as a serious alternative to the commonly used simple random sampling design. Ranked 
set sampling and some of its variants have been applied successfully in different areas of applications such as 
industrial statistics, environmental and ecological studies, biostatistics and statistical genetics. The feature 
of ranked set sampling is that it combines simple random sampling with other sources of information such 
as professional knowledge, auxiliary information, judgement, etc., which are assumed to be inexpensive and 
easily obtained. This extra information helps to increase the chance that the collected sample yields more 
representative measurements (i.e., measurements that span the range of the value of the variable of interest in 
the underlying population). In its original form, ranked set sampling involves randomly drawing k units (called 
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a set of size k) from the underlying population for which an estimate of the unknown parameter of interest 
is required. The units of this set are ranked by means of an auxiliary variable or some other ranking process 
such as judgmental ranking. For this ranked set, the unit ranked lowest is chosen for actual measurement of 
the variable of interest. A second set of size k is then drawn and ranking carried out. The unit in the second 
lowest position is chosen and the variable of interest for this unit is quantified. Sampling is continued until, 
from the A;th set, the kth ranked unit is measured. This entire process may be repeated m times (or cycles) to 
obtain a ranked set sample of size n = mk from the underlying population. 

Let ^SRS = {^i, i = 1, . . . , n} be a simple random sample (SRS) of size n > 1 from a continuous distribution 
with probability distribution function (pdf) f{x). Let F{x) denote the cumulative distribution function (cdf) 
of the random variable X and define F{x) = 1 — F{x) as the survival function of X with support Sx- Also 
assume that ^rss = = Ij ■ • • > = 1) ■■•) 'ti} denotes a ranked set sample (RSS) of size n = mk from 

f{x) where k is the set size and m is the cycle size. Here is the zth order statistic in a set of size k 

obtained in cycle j with pdf 

i.\ 

= (z3i)!(^^^'"'^(-)^^'"^^(-)/(-)' - ^ 

When ranking is imperfect wc use X|j^^ = {X^^j, i = 1, . . . , k, j = 1, m} to denote an imperfect RSS of size 
n = mk from f{x). We also use f[i]{x) to show the pdf of the judgemental order statistic which is given 

by 

n 
r=l 

where pi^r = '^{^[i] = ^{r}) denotes the probability with which the rth order statistic is judged as having rank 
i with J2i=iPi,r = Ylr=iPi,r — ^- Readers are referred to Wolfe (2004, 2010), Chen et al. (2004) and references 
therein for further details. 

The Fisher information plays a central role in statistical inference and information theoretic studies. It is well 
known that RSS provides more Fisher information than SRS of the same size about the unknown parameters 
of the underlying distribution in parametric inferences (e.g., Chen, 2000, Chapter 3). Park and Lim (2012) 
studied the effect of imperfect rankings on the amount of Fisher information in ranked set samples. Frey 
(2013) showed by example that the Fisher information in an imperfect ranked set sample may be higher than 
the Fisher information in a perfect ranked-set sample. The concept of information is so rich that there is no 
single definition that will be able to quantify the information content of a sample properly. For example, from 
an engineering perspective, the Shannon entropy or the Renyi information might be more suitable to be used 
as measures to quantify the information content of a sample than the Fisher information. In this paper, we 
study the notions of uncertainty and information content of RSS data in both perfect and imperfect ranking 
scenarios under the Shannon entropy, Renyi and Kullback-Leibler (KL) information measures and compare 
them with their counterparts with SRS data. These measures arc increasingly being used in various contexts 
such as order statistics by Wong and Chen (1990) and Park (1995), Ebrahimi et al. (2004), Bratpour et al. 
(2007a, b), censored data by Abo-Eleneen, (2011), record data and reliability and life testing context by Raqab 
and Awad (2000, 2001), Zahcdi and Shakil (2006), Ahmadi and Fashandi (2008) and in testing hypothesis by 
Park (2005), Balakrishnan et al. (2007) and Habibi Rad et al. (2011). So, it would be of interest to use these 
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measures to calculate the information content of RSS data and compare them with their counterparts with 
SRS data. 

To this end, in Section [2| we obtain the Shannon entropies of RSS and SRS data of the same size. We show 
that the difference between the Shannon entropy of ^rss and ^SRS is distribution free and it is a monotone 
function of the set size in ranked set sampling. In Section [3j similar results are obtained under the Renyi 
information. Section [4] is devoted to the Kullback-Leibler information of RSS data and its comparison with 
its counterpart under SRS data. We show that the Kullback-Leibler information between the distribution of 
^SRS and distribution of ^b.ss is distribution- free and increases as the set size increases. Finally, in Section 
[5} we provide some concluding remarks. 



2. Shannon Entropy of Ranked Set Samples 

The Shannon entropy or simply the entropy of a continuous random variable X is defined by 

YL{X) = - j f{x)\ogf{x)dx, (2) 

provided the integral exists. The Shannon entropy is extensively used in the literature as a quantitative 
measure of uncertainty associated with a random phenomena. The development of the idea of the entropy 
by Shannon (1948) initiated a separate branch of learning named the "Theory of Information". The Shannon 
entropy provides an excellent tool to quantify the amount of information (or uncertainty) contained in a sample 
regarding its parent distribution. Indeed, the amount of information which we get when we observe the result 
on a random experiment can be taken to be equal to the amount of uncertainty concerning the outcome of 
the experiment before carrying it out. In practice, smaller values of the Shannon entropy are more desirable. 
We refer the reader to Cover and Thomas (1991) an references therein for more details. In this section, we 
compare the Shannon entropy of SRS data with its counterparts under both perfect and imperfect RSS data 
of the same size. Without loss of generality, we take m = 1 throughout the paper. From ([2]), the Shannon 
entropy of X^srs is given by 



i=l 



?) = -^ / f{xi)logf{xi)dxi = nR{Xi). 
i=i '' 

Under the perfect ranking assumption, it is easy to see that 

n „ n 

H(X^S5) = - E / /(i)(^)log/«(x)dx = ^i7(X(,)), (3) 

i=l i=l 

where H(X(j)) is the entropy of the ith. order statistic in a sample of size n. Ebrahimi et al. (2004) explored 
some properties of the Shannon entropy of the usual order statistics (see also, Park, 1995; Wong and Chen, 
1990). Using ^ and the transformation = F"^(C/(j)) it is easy to prove the following representations for 
the Shannon entropy of order statistics (see, Ebrahimi et al. 2004, page 177): 

H(X(,)) = H([/(,)) - E [log [f{F-\W.i))]] , (4) 
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where Wi has the beta distribution with parameters i and n — i + 1 and f7(j) stands for the ith order statistic 
of a random sample of size n from the Uniform(0, 1) distribution. 

In the following result, we show that the Shannon entropy of RSS data is smaller than its SRS counterpart 
when ranking is perfect. 

Lemma 1. H(X/j5s) < H(X5/js) for all set size n £ N and the equality holds when n = 1. 

Proof. To show the result we use the fact that f{x) = ^Yli^=i f{i)i^) (see Chen et al., 2004). Using the 
convexity of g{t) = tlogt as a function of t > 0, we have 



n / n \ / 1 " 

i=l V i=l / V i=l 



:/(,)(x) j =f{x)logf{x). (5) 
Now, the result follows by the use of ^ and (U. □ 

In the sequel, we quantify the difference between H(X/j55) and H(X5/{5). To this end, by Q, we first get 

n „ n 

R{Xjtss) = J;h(C/(,)) - / ^f^i){x) log f{x)dx 

i=l 1=1 
n 

= j;H(C/(,)) + H(Xsij5). 

i=l 

Note that since H(Xjj55) < H(X5'ij5') we must have 'Yll=i^{U{i)) ^ 0' n G N. Also, H(X/j55) — 

H(X5ij5') = H(C/(i)) is distribution-free (doesn't depend on the parent distribution). Ebrahimi et al. 
(2004) obtained an expression for H(C/(j)) which is given by 

H(;7(i)) = \ogB{i,n-i + l) - (i - 1)[V'(^) - V'(" + 1)] - (n - i)[V'(n - f + 1) - V(n + 1)], 

where ip{z) = ^logr(2:) is the digamma function and B{a,b) stands for the complete beta function. Hence, 
we have 

n— 1 n 

R{Xrss) - H(X5/?5) = 2 J^(n - 2j) log j - n logn - 2 ^(i - + n{n - l)i;{n + 1) 

j=i i=i 

= k{n), say. 

By noting that ^(n + 1) = i/)(n) + 1/n, for n > 2, we can easily find the following recursive formula for 
calculating k{n): 

k{n + 1) = k{n) + n + log T{n) — (n + 1) log(n + 1). 

Table [T] shows the numerical values of H(X/j55) — H(X5R5) for n € {2, . . . , 10}. From Table [l| it is observed 
that the difference between the Shannon entropy of RSS data and its SRS counterpart increases as the set 
size increases. However, intuitively, this can be explained by the fact that ranked set sampling provides more 
structure to the observed data than simple random sampling and the amount of the uncertainty in the more 
structured RSS data set is less than that of SRS. 
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Table 1: The numerical values of k{n) for n = 2 up to 10. 



n 


2 


3 


4 


5 


6 


7 


8 9 


10 


k{n) 


-0.386 


-0.989 


-1.742 


-2.611 


-3.574 


-4.616 


-5.727 -6.897 


-8.121 



Now, assume that X|j^^ = {Xjj], i = 1, . . . , n} is an imperfect RSS of size n from f{x). Similar to the perfect 
RSS we can easily show that 

n 

H(XJj5s) = Eh(^h)> (6) 

i=l 

where we assume that the cycle size is equal to one and k = n. Also H(X[j]) = — J /[jj (x) log /[jj (x) dx, or 
equivalently 

= - J ^X^K,r/(r)(2;)^ log ^^Pi,r/(r)(a^)j dx. 

Again, using the convexity of g{t) = tlogt and the equalities X^"=iPi,r = '^^=iPi,r = 1, we find 

n 

1=1 

// ^ n n \ / ^ n n \ 

\ r=l i=l / \ r=l i=l / 



< —n 



= —n J /(x) log /(x)dx 
= H(Xs'/j5). 
So, we have the following result. 

Lemma 2. H(X^^^) < H(Xs/j5') for all set size n G N and the equality holds when the ranking is done 
randomly and pi^r = ^, for all i, r € {1, . . . , n}. 

In the following result we compare the Shannon entropies of perfect and imperfect RSS data. We observe that 
the Shannon entropy of X/J55 is less than the Shannon entropy of XJj^^. 

Lemma 3. H(Xjj5'5) < H(X|j5'^) for all set size n G N and the equality happens when the ranking is perfect. 
Proof. Using the inequality /[j](x) log/[j](x) < J^r=i Pi,rfir){x) log f^j.){x), we have 

n „ n 

H(X[i]) > -^Pi.r / /(r)(x)log/(,.)(x)(ix = 'Ypi^rH{X^,.)). 
r=l r=l 

Now, the result follows from ^ upon changing the order of summations and using Y^^=iPi,r = 1- CI 

Summing up, we find the following ordering relationship among the Shannon entropies of X|j^^, X/J55 and 
X5H5: 



Example 1. Suppose X has an exponential distribution with pdf f{x) = Ae~'^^, x > 0, where A > is the 
unknown parameter of interest. We consider the case where n = 2. For an imperfect RSS of size n = 2, we 
use the ranking error probability matrix 

p ^ Pl,l Pl,2 
_P2,1 P2,2_ 

Using ([l]), we have f[{]{x) = 2\e~^^ — Pi,2)e~^^ +Pi,2\, i = 1,2. Straightforward calculations show that 
UiXsRs) = 2 - 21ogA, Ri^Rss) = 3 - 21og(2A), and 

B{X*jiss) = 2-2 log(2A) + (p2,2 - + viPi,i) + ^(^2,2), 

where 

2 I I 

via) = ^_2a J ulogu du = -+ ^ _ [(1 - af log(l - a) - log a] , 

with < a < 1. It is easy to show that 

H(Xr55) - H(X5i?,5) = 1 - 21og2 Pi -0.3863 < 0, 

H(Xjj55) - H(X5ij5) = r/(pi,2) + - ^1,2) = 2r/(pi,2) - 2 log 2 < 0, 

H(Xh55) - ^{^*Rss) = 1 - V{P1,2) - r]{l-pi,2) = 1 - 2r?(pi,2) < 0. 

Figure [1] shows the differences between H(X|j5.^) and H(Xr55) with H(X5^5). It also presents the effect of 
ranking error on the amount of the Shannon entropy of the resulting RSS data by comparing H(X|j5.5') with 
II(X/j55'). It is observed that, the maximum difference occurs for pi^2 = 0.5. 



H(Xrss)"H(Xsrs) 



CO 

d - 
I 



0.0 



0.2 



0.4 



0.6 



1.0 



Pl,2 



Fi gure 1: Computed values of the difference between the Shannon entropies of X/jss and X|jgg compared with that of 'X.srs of 
the same size as a function of the ranking error probabiUty pi, 2- 
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3. Renyi Information of Ranked Set Samples 



A more general measure of entropy with the same meaning and similar properties as that of Shannon entropy 
has been defined by Renyi (1961) as follows 

H„(X) = -^log / r{x)du{x) = -^logE [r-HX)] , (7) 
1 — a J 1 — a 

where a > 0, a / 1 and di'(x) = dx for the continuous and dv{x) = 1 for discrete cases. It is well known that 

limH„(X) = - / /(x)log/(x)dx = H(X). 

Renyi information is much more flexible than the Shannon entropy due to the parameter a. It is an important 
measure in various applied sciences such as statistics, ecology, engineering, economics, etc. In this section, we 
obtain the Renyi information of ^rss and ^fiss ^^"^ compare them with the Renyi information of X^^j^. To 
this end, from Q, it is easy to show that the Renyi information of a SRS of size n from / is given by 

n 
i=l 

Also, for a RSS of size n, we have 

n 

Ha(Xfl5s) = 5]H„(X(,)). (9) 

i=l 

To compare }ia(X.sRs) with Hai^RSs) and Ha(X|j5.^), we consider two cases, i.e. < a < 1 and a > 1. 
First, we find the results for < a < 1 which are stated in the next lemma. 

Lemma 4. For any < a < 1 and all n S N, we have 

Proof. We first show that for any < a < 1, HaO^RSs) < ^^(Xlj^'^). To this end, using 



H„(Xj^5s) = 2^1°g J I > .P^jf(j)i^) I (10) 

and concavity of hi{t) = t", for 0<a<l,t>0, we have 

^ n „ n 

i=l j=l 
^ n n „ 

^ E E pi'i / Ai) 
i=i j=i 

n „ 



where the second inequality is obtained by using the concavity of h2{t) = logt, for t > 0. This, with ([s]), shows 
the result. To complete the proof we show that Hq,(X|j5'^) < Hq(X5r5) for any < a < 1 and all n G N. To 
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this end, from (10), and using /(x) = ^ Yl'i=i f[i]{^)^ '^^ have 

1 " r 

n „ 
1=1 •' 

= ^log /r(:E)dx = H(X5i^s)• 



□ 



In Lemma |4| we were able to show analytically an ordering relationship among the Renyi information of XJj^^, 
^RSS ^-iid ^sRS when < a < 1. It would naturally be of interest to extend such a relationship to the case 
where a > 1. It appears that similar relationship as in Lemma |4] holds when a > 1. However, we have not 
analytical proof here. 

Conjecture 1. For any a > 1 and all n E N, we have Ha(X/{5'5) < Hq,(X|j5.^) < Hq,(X5jj5). 

In Example [2] we compare the Renyi information of XJj^^, ^rss and 'Ksrs as a function of a in the case of 
an exponential distribution. The results are presented in Figure [2j which do support Conjecture 1. 

Example 2. Suppose the assumptions of Example [T] hold, then the Renyi information of a SRS of size n = 2 
is given by 

2 

Ha(X5/j5) = -21ogA - logo, a / 1. 

1 — a 

Straightforward calculations show that 

Ha(X(i)) = - log A -log 2- -^loga, 
' 1 — a 

H„ ) = - log A + — log 2 + — log I 'rp^llV } • 

and so the Renyi information of "Kjiss is given by Hq,(Xjj5'5) = Ha(X(i)) + Hq(X(2)). Now, 
H„(X^55) - Hai^SRs) = - log 2) + log ^ + ^^^^"^ 



1-a' ' 1-a ^ { r(2a + l) 

To obtain HQ,(X|jg5'), let 

Ui^x{x,t) = a^{t)e-^'' + hi{t), i = 1,2, 

where 

ai{t) = (-l)'(l - 2t) and hi{t) = t^^-'\l - t)^'^-'\ 
and pi^i = = -'^[i]) is defined in Example [l| Now, the Renyi information of XJj^^g is obtained as follows 

H«(Xjj55) = -^log2A + -^j;iog re-°"^C/°,(x,pi,i)a(x, a / 1, 
1 — a I — a ^-^ /n 



8 



which can be calculated numerically. Figurejlj^a) shows the values of Ha(X.'jiss) ~ ^aO^SRs) as a function of 
a for G {0.8,0.9,0.95,1}. When = 1, ^^(XJjs^) - H<,(X5K5) = Hai^Rss) - ^lai^SRs)- In Figure 
2]j^b) we show the effect of the ranking error on the Renyi information of ^rss by comparing Hq,(XJj^_5) and 
]ia(X.fiss) as functions of a for different values of pi^i. 



(a) 



Pi. 1-1 

Pl,1— 0.95 
pi,1— 0.9 
Pl,1— 0.8 



(b) 



Pi, 1 = 1 

Pl.1=0.95 

Pl.1=0.9 

Pl.1=0.8 



Figure 2: Comparison of the Renyi information of 'Krss and X|{sg with that of X-srs as a function of a. The value of 
Ha{X*jiss) - HQ(Xsfls) are presented in (a) while //a(X|jss) - Uai'X.Rss) are given in (b). 



Note that for a > 1 the difference between the Renyi information of ^rss with its counterpart under SRS 
can be written as follows 

Ha(XfiS5) - Hai^SRs) = E log j f^){x)dx " log j rix)dx 



i=l 
n 



1 — a 



i=l 



f f°'{x)dx J 
log n + — ^ log E [{P^(H.) (T 



1=1 



where T\W = w ^ Bin{n — 1, F{'w)) and W has a density proportional to f"{w), i.e. g{w) = j fo'(w)dw ' ^'^^^^ 
^F{w)i'F = i — 1) < 1 for alH = 1, . . . , n — 1 and fixed w, we have 

iogE[{p^(H/)(r = i-i)r] <o, 

for all a > 1. This results in a lower bound for the difference between the Renyi information of ^rss and 
"^SRS as lia(X.Rss) — ^ai^SRs) ^ jz^n-^ogn. In the following result, we find a sharper lower bound for 
Ha(X/j5s) - RaC^SRs) when a > 1. 



Lemma 5. For any a > 1 and all n > 2, we have Ha(X/{5'5) — Hq,(X5'/j5') > ^{a,n), with 

— 1^ 



a 



1 — a 



n 



1 



I — 1 \ n — I 



n — 1 



n-1 
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where ^{a,n) G 



i^logn,0) 



Proof. Using the pdf of and the transformation F{X) = U, we have 

where f^n-i+iiu) is the pdf of a Beta{i,n — i + 1) random variable with its mode at u* = ^Eby- Now, since 
f*n-i+i{u) < we have 

H„(X«,5) > :r^i;iog|n("-')(^ri(^r4 + -^log rr-nF-^(n))dn 
I — a — IJn — 1 n — 1 ) 1 — a Jq 

= ^{a,n) + Yl{-KsRs), 

where 

n) = -i^± log |n(" - ;) (i^)-,^)"-'} ^ (12) 
1 — a 'f-^ [ \i — \ J n — 1 n — 1 J 

It is easy to show that for all n G N and q > 1, ^'(a,n) < 0. To do this, one can easily check that 
^{a,n + 1) < ^'(a,n) for ah n > 2 with ^-(0,2) = ^ < 0. Also, since ("li) < 1 for ah 
i = 1, . . . ,n, we have ^'(a,n) > EILi^og^ = f^log"-- ^ 



4. Kullback-Leibler Information of Ranked Set Samples 

In 1951 Kullback and Leiber introduced a measure of information from the statistical point of view by compar- 
ing two probability distributions associated with the same experiment. The Kullback-Leibler (KL) divergence 
is a measure of how different two probability distributions (over the same sample space) are. The KL divergence 
for two random variables X and Y with cdfs F and G and pdfs / and g, respectively, is given by 

K(X,y) = j /(t)log(M^dt. (13) 

Using the same idea, we define the KL discrimination information between Ji-ptss and ^SRS as follows: 

K(X5/j5,X/j5s) = fi^sRs) log (^ j|xfigg) ) 

It is easy to see that 

KiXsRs,^RSs) = Yl [ fix)log(^^)dx = Y,nx,X^.,^). (14) 
i=i \J{t)\^) J 
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By substituting the pdf of in (14), we find 



dx 



i=l 
n „i 



log M • ) + ^('^ ~ 1) 

i=i 



:= dn. (15) 

Note that K(X5ij5, X^g^) is distribution-free, and {dn,n = 1,2,...} is a nondecreasing sequence of non- 
negative real values for all n G N. That is, the KL information between the distribution of SRS and the 
distribution of RSS of the same size increases as the set size n increases. 



Remark 1. It is well known that the KL divergence is non-symmetric and can not be considered as a distance 
metric. In our problem, note that 

n n n 

KiXRss,^SRs) = ^K(X(i),X) = Y,^{U^i),U) = -^H(C/(,)) = -kin), 

1=1 i=l 1=1 

where U has uniform distribution. Various measures have been introduced in the literature generalizing this 
measure. For example, in order to have a distance metric, the following symmetric Kullback-Leibler distance 
(KLD) is proposed. 

KLD(X, Y) = K{X, Y) + K{Y, X). 

Lemma 6. Suppose X5/J5 is a SRS of size n from /(x) and let Yrss and Ysb,s be independent RSS and 
SRS samples of the same size from another distribution with pdf g{x), respectively. Then, 

K(X5i^5, Y5R5) < K(XsRS, Yr55). 
Proof. To show the result, by the use of the fact that g{x) = ^ J2i=i 9{i){x)^ have 

K(Xsrs,Yrss) = V //(x)logf^^')dx 



dx 



where the inequality is due to the convexity of h{t) = — logt. □ 
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Now, let X^55 = {X^q,i = 1, . . . , n} be an imperfect RSS of size n from f{x). Then 

n „ 



y" /(x)log ffjp,,,jQF(^-i)(x)F('^-^)(x) I c/x 



-n 



Therefore, the KL discrimination information between the distribution of ^SRS and ^rss is also distribution 
free and it is only a function of the set size n and the ranking error probabihties pij = P{X^q = X(^j^). 

In the following lemma, we show that the KL information between the distribution of a SRS and a perfect 
RSS of the same size is greater that the one with imperfect RSS. 

Lemma 7. Suppose ^SRS is a SRS of size n from the pdf f{x) and denote X^^^ and ^rss as independent 
perfect and imperfect RSS data of the same size from /, respectively. Then, 

Proof. To show the result note that 

Y.Y.Pi, £ log Q u^-\i - u)"-^) du 



< 



J 

J2 1' log (i (^) - du 



3 

= K;(X5fl5,XR55), 

which completes the proof. □ 
Another result which is of interest is to compare K(Xij55, YR55) with K.{yisRs,^ SRs)- To this end, we have 

K(^^s,^.s.) = |:/A.,w{'°.(f|)+'°.( g:"i:g::;i:; )}'fe 

= K(XsKs,Ys„s) + ^„(-F',G), 

where 
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Here again K^X.srSj'Ysrs) < ^O^RSS,'^RSs) if An{F, G) >0. Furthermore, it is easy to show that 

n(n — 1) 



du. 



Ar,{F, G) = _ n{n - 1) {u\ogGiF-\u)) + (1 - u)G{F-\u))} 

Note that in this case An{F, G) depends on the parent distributions of X and Y samples. 

Example 3. Suppose that X and Y have the exponential distributions with parameters Ai and A2, and pdfs 
f{x) = Xie~^^^ and g{y) = A2e~'^2^, respectively. Then 



An{F,G) = - 



n(n — 1 


) 


n(n - 




'A2 


r 


2 




3 




vAi 




n{n — 1) 


"A2 


G- 


A2^ 

3Ai^ 


)- 


3" 
2 



_j _„(„_i)|g___g_ 



< 0. 



So, for the exponential distributions K{Ji.sRS,^SRs) > ^(^RSSj^RSs)- 



5. Concluding Remarks 

In this paper, we have considered the information content of the perfect and imperfect RSS data using the 
Shannon entropy, Renyi and KuUback-Leibler information measures. First, we have compared the Shannon 
entropy of a SRS data (X.rss) to the Shannon entropy of perfect RSS (X-rss) and imperfect RSS (X^j^^) of 
the same size. In this case, we have established analytically that the Shannon entropies of 'X.uss and X^^^ 
are less that the Shannon entropy of ^SRS- We also showed that the Shannon entropy of the RSS data will 
increase in the presence of the ranking error. Next, we have estabUshed similar behaviour under the Renyi 
information when < a < 1, while the results for the case were a > 1 remain unsolved. We conjectured that 
similar results hold for the case where a > 1 and provided examples to support the conjecture. Similar results 
arc obtained under the Kullback-Lciblcr information measure. The results of this paper show some desirable 
properties of ranked set sampling compared with the commonly used simple random sampling in the context 
of the information theory. 
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